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Abstract. We study the way in which the abstract structure of a smaU 
overlap monoid is reflected in, and may be algorithmically deduced from, 
a small overlap presentation. We show that every C(2) monoid ad- 
mits an essentially canonical C(2) presentation; by counting canonical 
presentations we obtain asymptotic estimates for the number of non- 
isomorphic monoids admitting a-generator, fc-relation presentations of 
a given length. We demonstrate an algorithm to transform an arbitrary 
presentation for a C(m) monoid (m at least 2) into a canonical C(m) 
presentation, and a solution to the isomorphism problem for C(2) pre- 
sentations. We also find a simple combinatorial condition on a C(4) 
presentation which is necessary and sufficient for the monoid presented 
to be left cancellative. We apply this to obtain algorithms to decide if a 
given C(4) monoid is left cancellative, right cancellative or cancellative, 
and to show that cancellativity properties are asymptotically visible in 
the sense of generic-case complexity. 



1. Introduction 

Small overlap conditions are natural combinatorial conditions on semi- 
group and monoid presentations, which serve to limit the complexity of 
derivation sequences between equivalent words. First studied by Remmers 
[T2I [T3] in the 1970's, they are the natural semigroup-theoretic analogue 
of the small cancellation conditions extensively used in combinatorial and 
geometric group theory [T^. Like word hyperbolicity for groups [TT], small 
overlap properties are generic for monoid presentations with a fixed number 
of generators and relations, in the sense that a randomly chosen a-generator, 
A;-relation presentation of size n satisfies any given small overlap condition 
with probability which approaches 1 as n increases [6]. 

Remmers' original work showed that monoids with presentations satisfy- 
ing the condition C(3) have decidable word problem; an accessible account 
of this and related results can be found in [4j . Recent research of the author 
[71 El E] has shown that monoids with presentations satisfying the slightly 
stronger condition C(4) have linear time solvable word problem, and a reg- 
ular language of linear-time computable normal forms, and are also rational 
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(in the sense of Sakarovitch [14]), asynchronous automatic and word hyper- 
bohc (in the sense of Duncan and Oilman ^ ) . 

This paper is devoted to the question of how abstract algebraic properties 
(by which we mean isomorphism invariants) of small overlap monoids are 
reflected in, and can be algorithmically deduced from, their presentations. 
Section [2] briefly recalls some key definitions and results from the theory of 
small overlap presentations. In Section [3] we consider isomorphisms between 
small overlap monoids. We show that every isomorphism type of small 
overlap monoids has a canonical small overlap presentation, and that this 
presentation can be effectively computed from any other presentation for the 
monoid. It follows that the isomorphism problem for small overlap monoids 
is decidable. By counting canonical presentations, we obtain asymptotic 
estimates for the number of isomorphism types of a-generator A;-relation 
semigroups of a given size, as a function of the positive integers a and k. 

In Section H] we turn our attention to cancellativity properties. By apply- 
ing results from [3 E] , we are able to give elementary combinatorial charac- 
terisations of those C(4) presentations which present left cancellative, right 
cancellative and cancellative monoids. Since these properties of presenta- 
tions can be easily tested, it follows that one can efficiently decide whether 
a given C(4) presentation presents a left, right or two-sided cancellative 
monoid. We deduce also that left cancellativity, right cancellativity and 
cancellativity are asymptotically visible properties of finite presentations, in 
the sense that the proportion of ^/-generated, fe-relation monoid presenta- 
tions of size n which present monoids with any of these properties converges 
to a limit strictly between and 1, as n increases. 

2. Small Overlap Monoids 

In this section we recap the definitions and some key results concerning 
small overlap monoids. We begin by recalling the basic definitions of combi- 
natorial semigroup theory, chiefly in order to fix notation and terminology. 
Let be an alphabet, or set of symbols. A word over ^ is a finite sequence 
of zero or more symbols from £/. The free monoid si* is the set of all words 
over si , with multiplication defined by concatenation of sequences. The 
unique empty word containing no symbols is denoted e; it forms the identity 
element in s/* . A monoid presentation consists of a pair {s/ \ where 
si is an alphabet, and is a binary relation on the free monoid s/*. The 
presentation is called finite if both si and ^ are finite. The elements of the 
binary relation which are pairs from si* x si*, are rather confusingly 
called the relations of the presentation. A relation with both sides the same 
(that is, one of the form {u,u) for some u G si*) is termed trivial. 

The maximum relation length of the presentation is the length of the 
longest word appearing as one side of a relation in ^ (or if there are no 
relations), while the sum relation length is the total length of all the words 
forming sides of relations in 1%. 

We say that a word u E i/* is obtained from a word v G ^* by an 
application of a relation from ^ ii u = pxq and v = pyq for some words 
p, q,x,y G si* such that (x, y) £ ^ oi (y, x) G We define a binary 
relation (called just = where there is no ambiguity as to the presentation) 
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on s^* by u v '\i v can be obtained by u by a finite sequence of zero 
or more applications of relations from Si. In fact is a congruence on 
that is, an equivalence relation compatible with the multiplication. We 
denote by [u]^ (or just [tt]) the equivalence class of the word u G s^* . The 
equivalence classes form a monoid with multiplication well-defined by 

this is called the monoid 'presented by the presentation. 

We say that a word p is a possible prefix of u if there exists a (possibly 
empty) word w with pw = u, that is, if the element represented by u lies in 
the right ideal generated by the element represented by p. The empty word 
is denoted e. 

A relation word is a word which occurs as one side of a relation in the 
presentation. A piece is a word in the generators which occurs as a factor in 
sides of two distinct relation words, or in two different (possibly overlapping) 
places within one side of a relation word. By convention, the empty word 
is always a piece. We say that a presentation is C(n), where n is a positive 
integer, if no relation word can be written as the product of strictly fewer 
than n pieces. Thus for each n, C(n + 1) is a stronger condition than C{n). 

Notice that it is permissible for the same relation word to appear twice or 
more in a C(n) presentation, since by doing so it does not become a factor 
of two distinct relation words. We say that a presentation is strongly C{n) 
if it is C(n) and additionally has no repeated relation words. The condition 
we have called C{n) is that used in [H [121 [H] a-^d was called weakly C{n) 
in [7], while the condition we have called strongly C{n) was called C(n) in 
[Uinill]. See [7] for detailed explanation of the relationship between the two 
definitions. 

We say that a relation word i? is a complement of a relation word R if 
there are relation words R = Ri, R2, ■ ■ ■ , Rn = R such that either [Ri, Ri+i) 
or (i?j_|_i, i?j) is a relation in the presentation for 1 < i < n. We say that 
R is a proper complement of R if, in addition, R ^ R. Abusing notation 
and terminology slightly, if R = XhYhZr and R = X-^-^Z^ then we write 
= X-j^, XjiYr = X-j^Y-j^ and so forth. We say that Xji is a complement 
of X^, and XpiYji is a complement of X/jYr. 

Now let (iz/ I S^) be a C(3) presentation. Recall that a relation prefix 
of a word is a prefix which admits a (necessarily unique, as a consequence 
of the small overlap condition) factorisation of the form aXY where X 
and Y are the maximal piece prefix and middle word respectively of some 
relation word XYZ. An overlap prefix ( of length n) oi a word u is a relation 
prefix which admits an (again necessarily unique) factorisation of the form 
bXiYlX2Y^ . . . XnYn where 

• n > 1; 

• bXiY(X2Y2 . . . XnYn has no factor of the form XqYq, where Xq and 
Yq are the maximal piece prefix and middle word respectively of some 
relation word, beginning before the end of the prefix 6; 

• for each 1 < i < n, Ri = XiYiZi is a relation word with Xi and Zi 
the maximal piece prefix and suffix respectively; and 

• for each 1 < i < n, 1^' is a proper, non-empty prefix of YJ. 
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Notice that if a word has a relation prefix, then the shortest such must be 
an overlap prefix. A relation prefix aXY of a word u is called clean if u 
does not have a prefix 

aXY'XiYi 

where Xi and Yi are the maximal piece prefix and middle word respectively 
of some relation word, and Y' is a proper, non-empty prefix of Y. 
We recall some key technical results about C(n) presentations. 

Proposition 2.1 ([7J Proposition 3). Suppose a word u has an overlap 
prefix wXY and that u = wXYu" . Then u = v if and only if v = wv' where 
v' = XYu". 

Lemma 2.2 ([7| Lemma 3). Let {s^ \ he a C(4) presentation. Suppose 
u = XYu' where XY is a clean overlap prefix of u. Then u = v if and only 
if one of the following mutually exclusive conditions holds: 

(1) u = XYZu" and v = XYZv" and Zu" = Zv" for some complement 
Z ofZ; 

(2) u = XYu' , V = XYv' , and Z fails to he a prefix of at least one of u' 
and v' , and u' = v' ; 

(3) u = XYZu" , V = XYZv" for some uniquely determined proper 
complement XY Z of XY Z , and Zu" = Zv" for some complement 
ZofZ; 

(4) u = XYu' , V = XY Zv" for some uniquely determined proper com- 
plement XY Z of XY Z hut Z is not a prefix of u' and u' = Zv"; 

(5) u = XYZu" , V = XYv' for some uniquely determined proper com- 
plement XY Z of XY Z , hut Z is not a prefix of v' and Zu" = v' ; 

(6) u = XYu' , V = XYv' for some uniquely determined proper comple- 
ment XY Z of XY Z , Z is not a prefix of u' and Z is not a prefix of 
v' , hut Z = z\z, Z = ziz, u! = ziu" , v' = Z2v" where u" = v" and z 
is the maximal common suffix of Z and Z, z is non-empty, and z is 
a possihle prefix of u" . 

Proposition 2.3 ([7J Corollary 1). Let {^^ \ ^) he a C(3) presentation. Lf 
a word u has no clean overlap prefix, then it contains no relation word as a 
factor, and so if u = v then u = v. 

3. Minimal Presentations and Isomorphisms 

Our main aim in this section is to show that every small overlap monoid 
has a canonical small overlap presentation, and that this presentation can 
be effectively computed from any other presentation for the monoid. From 
this we are able to demonstrate a solution to the isomorphism problem for 
monoid presentations satisfying the condition C(2). Some of the essential 
ideas in this section were prefigured in work of Jackson although the 
main results and applications given here are new. 

We begin by introducing some terminology. A presentation (jz/ | ^) is 
called an equivalence presentation if the set ^ of relations is an equiva- 
lence relation on the set of relation words. The equivalence closure of a 
presentation | ^) is the presentation {£/ \ ,y) where J/' is the reflexive, 
symmetric, transitive closure of Clearly, {s^ \ .5^) presents the same 
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monoid as (i?/ | Notice also that, since we require the set of relations 
to be an equivalence relation only on the set of relation words, and not on 
the whole free monoid =e/*, the equivalence closure of a finite presentation 
is still finite, and can be easily computed. It has the same set of relation 
words as the original presentation. In particular, it satisfies C(m) for any 
m > 1 if and only if the original presentation satisfies C{m). 

Recall that if {a/ | is a monoid presentation then a generator a £ J2f 
is called redundant if it is equivalent to a product of (zero or more) other 
generators, or equivalently, if the monoid presented by the presentation is 
generated by the subset ^ \ {a}. We call a presentation generator-minimal 
if it has no redundant generators. Notice in particular that, in a generator- 
minimal presentation, no two generators represent the same element. A 
non-identity element s of the monoid presented is called indecomposable (or 
by some authors, an atom) if whenever x, y are elements of the monoid 
such that xy = s we have x = 1 or y = 1. A generator a G ^ is called 
indecomposable if [a]^ is indecomposable. Notice that a non-identity inde- 
composable element of a monoid must belong to every generating set for the 
monoid. 

Proposition 3.1. Let \ ^) be a C(2) presentation and u a relation 

word. Then for any v € s^* we have u = v if and only if there exists 
n > 1 and words u = ui,U2, ■ ■ ■ ,Un = v such that for 1 < i < n either 
(ui, Ui+i) or {ui+i,Ui) e ^. 

Proof. One implication is immediate from the definitions. For the converse, 
observe that since the presentation satisfies C(2), no relation word contains 
another relation word as a proper factor. It follows that the only relations 
which can be applied to the word u are relations of the form (u, u') or (u', u). 
But now u' is also a relation word, so a simple inductive argument shows 
that any rewriting sequence taking u to v must have the given form. □ 

Proposition 3.2. Let {£/ | be a C(2) presentation and u a relation 
word. Then for any proper factor u' of u and any v G =!2/* we have u' = v if 
and only if u' = v. 

Proof. Since the presentation satisfies C(2), no relation word contains an- 
other relation word as a proper factor. It follows that u' contains no relation 
word as a factor, and so no relation can be applied to it. □ 

Proposition 3.3. Let {si \ be a monoid presentation satisfying C{2), 
and let a E: s/. Then a is either indecomposable or redundant. 

Proof. Since the presentation satisfies C(l), the empty word is not a relation 
word. If the presentation contains no non-trivial relations of the form {w, a) 
or (a, w) then no relations arc applicable to a, and so a is indecomposable. 
On the other hand, suppose the presentation does contain a non-trivial 
relation of the form {w, a) or (a, w). Then a is a relation word, which since 
the presentation satisfies C(2) means that a is not a piece, and so a cannot 
feature in the word w. Thus, the given relation can be used to rewrite a as 
a product of the other generators, which means that a is redundant. □ 
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Proposition 3.4. Letm, > 2. Every C{m) m,onoid has a generator-minimal 
C{m) equivalence presentation, which can be effectively computed starting 
from any C (m) presentation for the monoid. 

Proof. Let {s^ | ^) be a C(m) presentation for M, and suppose that this 
presentation is not generator-minimal. Let a G =c/ be a redundant generator. 
Clearly some non-trivial relation in M can be applied to the word a; since 
the presentation satisfies C(l) neither side of this relation can be the empty 
word, so this relation must have the form (a, w) or {w, a) for some w G 
with w ^ a. 

Let d denote the set of all words w such that (a, w) or (w, a) is a relation 
and w ^ a. Since the presentation satisfies C(2) and a is a relation word, a 
cannot be a factor of any w E a. Let 

^ = ^\{a} 

and 

y = {^\{{a,w), {w,a) I w G £/*}) U {{u,v) \u,v e a}. 

Consider now the presentation \ Since every relation word in 

I ^) is also a relation word in {^Z | ^), it is clear that {3^ \ y) is a 
Cirri) presentation with strictly fewer generators than (iz/ | M). We claim 
that I .5^) is a presentation for the same monoid as (iz/ | Si)^ Indeed, 
let i : — > .2/ be the inclusion map. Then i extends to a morphism 
a : ^* sZ* of free monoids, and since every relation in 5^ is satisfied in 
I ^), this induces a morphism a : {SS \ y) (is/ | M) well-defined by 
(t([w]^) = [a{w)]gi. Now the image of a contains [w]^ = [a],^ and also \b]^ 
for every h ^ = \ {a}, so a must be surjective. Now suppose tt, f G 3§* 
are such that (7(['u]^) = a{[v\y). Then a{u) =ag a{v), so there is a sequence 
of words 

u = a{u) = yo, yi, . . . , y„ = a{v) = v £ £/* 

such that each yj+i can be obtained from y^ by an application of a single 
relation in For each i, let Zi be the word obtained from by replacing 
any occurences of the letter a with the word w. Since a is not a proper 
factor of any relation word in | it follows from the definition of y 
that each yj+i can be obtained from yj by an application of a single relation 
from y. But yo = u and y„ = so we deduce that [u]y = [v]y. Thus, a 
is injective, and so is an isomorphism. 

We have shown that given a non-generator-minimal C{m) presentation, 
there always exists a C(m) presentation for the same monoid with strictly 
fewer generators; it follows that a C{m) presentation with the fewest pos- 
sible number of generators must be generator-minimal. Moreover, given a 
non-generator-minimal C(m) presentation, one may identify a redundant 
generator by seeking a relation of the form (a, w) or a) for some a G 
and w G iz/*, and then use the approach described above to effectively 
compute a C{m) presentation with strictly fewer generators. By iteration, 
one may thus compute a generator-minimal C (m) presentation for the same 
monoid. 

Finally, to obtain a generator-minimal C(m) equivalence presentation, it 
suffices to take a generator-minimal C{m) presentation and compute the 
equivalence closure of the set of relations. By our remarks at the beginning 
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of the section, this is a C{m) equivalence presentation for the same monoid, 
and is clearly still generator-minimal since it has the same generating set. □ 

Corollary 3.5. In any monoid admitting a C(2) presentation, the set of 
non-identity indecomposable elements forms a generating set which is con- 
tained in every generating set for the monoid. 

Proof. By Proposition 13.41 such a monoid has a generator-minimal C(2) 
presentation, that is, one in which no generators are redundant. By Propo- 
sition[3]3l the generators in this presentation must be indecomposable, which 
shows that the set of all non-identity indecomposable elements forms a gen- 
erating set. Finally, we have already observed that every non- identity inde- 
composable element must lie in every generating set for the monoid. □ 

We now introduce some more terminology. Let (j# | ^) and (i^ | ,y) be 
presentations. An isomorphism a : £/* ^* of free monoids is called an 
inclusion of {£/ \ ^) into [SS \ 5^) if for every relation 

(n,u) e ^ 

there is a relation 

We say that (iz/ | S^) is a sub-presentation of \ 5^) if there exists an 
inclusion of (^Z | ^) into {SS \ S^). An isomorphism between two presenta- 
tions is an inclusion whose inverse is also an inclusion, and two presentations 
are called isomorphic if there is an isomorphism between them. Recall that, 
since a free monoid has a unique free generating set, isomorphisms between 
free monoids are in a natural one-one correspondence with bijections be- 
tween the corresponding free generating sets. While conceptually inclusions 
should be thought of as isomorphisms between free monoids, computation- 
ally it is more practical to work with the corresponding bijections. Notice 
that it is a completely routine matter to determine, given two finite presen- 
tations and a bijection between the alphabets, whether the bijection extends 
to an inclusion (and hence also whether it extends to an isomorphism). 

The following theorem says, informally, that any generator-minimal equiv- 
alence presentation contains a copy of every generator-minimal C(2) presen- 
tation for the same monoid. 

Theorem 3.6. Let {£/ \ ^) be a generator-minimal C(2) presentation 
and {SS I S^) be any generator-minimal equivalence presentation. Let a : 
{^Z I ^) I =5^) be an isomorphism between the monoids presented. 

Then there is a unique morphism of free monoids a : £/* — > such that 
[a{a)]_y = a{[a\^) for all a G £/, and this morphism is an inclusion of 
{£/ I ^) into 

Proof. Since {s^ \ ^) is generator-minimal, the formal generators must rep- 
resent distinct elements of the monoid and, by Proposition 13.31 those el- 
ements are indecomposable. Since a is an isomorphism, we deduce that 
the images of these generators are indecomposable, which by Corollarv 13.51 
means that they lie in the set of elements represented by the formal gen- 
erators in It follows that we can define a : £/* ^* by chosing for 
each a G £/ an element a{a) E ^ such that [(T(a)]^' = (T([a]^), and then 
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extending to a morphism of jz/* using the universal property of free monoids. 
Moreover, since {SS \ S^) is generator-minimal, the generators in | 5^) 
represent distinct elements, so these choices are unique and a is the unique 
function with the given property. 

Now (T is injective, so a separates £^ ^ and since ^ is free it follows that 
a is injective. Moreover, a is an isomorphism, so the images of the gener- 
ating set for {s^ I ^) comprise a generating set for {3S \ S^). Since they 
are contained in ^ and | 5^) is generator-minimal, this means that a 
maps bijectively onto and hence is an isomorphism between s^* and 
This isomorphism clearly has the property that u =tig v and only if 
a{u) =y (t(v). 

Now suppose {x, y) G To show that a is an inclusion, and hence 
complete the proof of the theorem, we need to show that (o'(x), (t(?/)) € y. 
By the property of a described above we certainly have that (j{x) =y (j{y) 
in the monoid {^^ \ y) . This means that we can choose a sequence of words 

lio = <t(x), ui, U2, Un = a{y) G 3S* 

such that each Uj+i can be obtained from Ui by an application of a relation 
from y . This by definition means that for 1 < i < n we may choose 
Pi,Qi,Q'i,i"i G ^* such that Ui = PiQiVi, Uj+i = Pig-r^, qi / g- and (using the 
fact that y is symmetric) also {qi^q^) G y. 

For each i, let Wi = d~^{ui) G £/*. Then using again the property above 
of a we have that wi, . . . , Wn are all equivalent in {£/ \ to the relation 
word wq = a~^{uQ) = x. By Proposition 13. H this means in particular that 
every Wi is a relation word in the presentation | M). 

We claim now that Ui = q-i for every i. Indeed, suppose for a contradiction 
that there is some i for which qi ^ Ui. We know that qi =y q[ and qi ^ q[ 
(since the rewrite is non-trivial), which by the property of a noted above 
means that ^~^{qi) CF~^{q[) and CF~'^{qi) 7^ a~^{q[). But (T~^((7i) is a 
proper factor of Wi = a~^{ui), which is a relation word, so by Proposition [32] 
we must have (T~^((j'i) = a~^{q[) and hence qi = q[ which gives the desired 
contradiction and completes the proof of the claim that Ui = qi for every i. 

It follows immediately that q[ = Uj+i = qi+i- But now (?Xj,ttj+i) = 
(<?2) Q-'d ^ every i, which since y is transitive, means that (ito, Un) G y 

as required. □ 

Corollary 3.7. Two generator-minimal C{2) equivalence presentations present 
isomorphic monoids if and only if they are isomorphic. 

Proof. It is obvious that isomorphic presentations present isomorphic monoids. 

Conversely, let | ^) and \ y) be generator-minimal C(2) equiv- 
alence presentations, and let ai : \ ^) — > | y) be an isomorphism 
between the monoids presented, and ai : \ y) {£/ \ SS) be its inverse. 
Let (Ti : si* — > SS* and ui : SS* £/* be the inclusions given by Theo- 
rem 13.61 To show that the presentations are isomorphic, it will suffice to 
show that the inclusions ai and (T2 are mutually inverse. Now we know from 
Theorem 13.61 that (T2 is the unique isomorphism from to s/* satisfying 

for all 6 G so it will suffice to show that a7^ also satisfies this condition. 
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By the properties of ai guaranteed by Theorem 13.61 we have 

[<5"i(a)].y' = ai{[a\^) 
for all a G . Applying a^^ = (T2 to both sides we obtain 

<y2{[^i{a)].y) = o-2(o-i([a]^)) = [a]^. 
Now for any b G letting a = di^ih) in this expression yields 

a2{[h]y) = [a^\b)y 
as required. □ 

As a straightforward consequence, we obtain the fact that the isomor- 
phism problem for C(2) monoids is algorithmically solvable. 

Theorem 3.8. There is an algorithm which, given as input two C(2) pre- 
sentations, decides whether the monoids presented are isomorphic. 

Proof. Given two C(2) presentations, by Proposition 13.41 we may compute 
generator-minimal C(2) equivalence presentations for the same monoids. 
Now by Corollary 13.71 it suffices to check if the resulting presentations are 
themselves isomorphic. This can clearly be done, for example by enumerat- 
ing all bijections between the generating sets and checking if any extend to 
isomorphisms. □ 

The following theorem says that a small overlap presentation without re- 
dundant generators will always satisfy the strongest small overlap conditions 
satisfied by any presentation for the same monoid. 

Theorem 3.9. Let m be a positive integer, and {a/ \ M) be a generator- 
minimal C(2) presentation for a monoid which admits a C{m) presentation. 
Then {a/ \ Si) is a C{m) presentation. 

Proof. If m < 2 then the claim is trivial, so suppose m > 3. 

Suppose for a contradiction that {s^ \ ^) does not satisfy C{m). Then 
there is a relation word r G s^* which can be written as a product of strictly 
fewer than m pieces of | ■^), say r = rir2 ■ . .rj where ri, . . . , € £/* 
are pieces and j < m. 

Now the monoid presented by (i?/ | S^) admits a C{m) presentation, so 
by Proposition 13.41 it admits a generator-minimal C{m) equivalence presen- 
tation. Let I 5^) be such a presentation and a : {s^ \ {B§ \ 5^) 
an isomorphism. Let a : =2/* — > S^* be the corresponding morphism of free 
monoids given by Theorem 13. 6i Since a is an inclusion it follows easily 
that a{r) is a relation word in | .y), and o"(ri), . . . ,d'{rj) are pieces of 
I But since a is a morphism, we have 

a{r) = a{ri)a{r2) . ■■CF{rj) 

which contradicts the assumption that | S^) satisfies the condition C{m). 

□ 

We also obtain a corresponding statement for strong small overlap condi- 
tions. 
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Corollary 3.10. Let m be a positive integer, and {£/ \ he a generator- 
minimal strongly C(2) presentation for a monoid which admits a strongly 
C{m) presentation. Then {s^ \ Si.) is a strongly C{m) presentation. 

Proof. By Theorem l3.10l (£/ \ is a C{m) presentation. Since it is strongly 
C(2) it contains no repeated relation words, which means it is strongly 



Corollary 3.11. There is an algorithm which, given as input a monoid pre- 
sentation, finds a generator-minimal C{m) equivalence presentation for the 
same monoid with m > 2 as high as possible, provided such a presentation 
exists (and may not terminate otherwise). 

Proof. If the presentation satisfies C{m) for some m > 2 then by definition 
it satisfies C(2), and by Proposition 13.41 the monoid presented admits a 
generator-minimal C(2) equivalence presentation. By Theorem l3.9l any such 
presentation must also satisfy C{m), so it suffices to find such a presentation. 
It is well known and easy to show (using for example the theory of Tietze 
transformations [2, Section 7.2]) that there is an algorithm which, given 
as input any monoid presentation, recursively enumerates all presentations 
isomorphic to it. So it suffices to do this, checking if each is a generator- 
minimal C(2) equivalence presentation, until we find one which is. □ 

Corollary 3.12. Let m be a positive integer. Then there is no algorithm to 
decide, given as input a monoid presentation, whether the monoid presented 
admits a C{m) presentation. 

Proof. It is well known (see for example Corollary 7.3.8]) that there is 
no algorithm to decide, given a monoid presentation, whether the monoid 
presented is trivial. The trivial monoid admits the empty presentation, 
which is a C{m) presentation for every m. If there were an algorithm to 
decide whether a given monoid admits a C{m) presentation then one could 
decide whether the monoid was trivial, by first checking if it admits a C (m) 
presentation. If not then it cannot be trivial. If so then by Corollary 13.111 
we could compute a C{m) presentation for the monoid, and by Theorem l3.8l 
we could check if it is isomorphic to the trivial monoid. □ 

Using Theorem l3.8l and techniques from [6], we can obtain asymptotics for 
the number of non-isomorphic semigroups admitting a-generator, A;-relation 
presentations of a certain length. Recall that for functions /, 5 : N ^ N we 
say that /(n) is 0{g{n)) [respectively, Q{g{n), 6{g{n))] if /(n) is bounded 
above [respectively, below, above and below] by a linear function of g{n) 
with positive coefficients. 

Theorem 3.13. Let a and k be fixed positive integers. Then there are 
9{a'^n'^^~^) distinct isomorphism types of semigroups admitting a-generator, 
k-relation presentations of sum relation length n. 

Proof. The proof utilises some arguments from [6]; since this result is not 
the main purpose of the present paper we refrain from repeating these in 
detail and instead refer the reader to [6j for a more detailed explanation 
whether appropriate. 




□ 
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For ease of counting, we will first consider ordered presentations, by which 
we mean a pairs {£/ \ ^) where is an alphabet and is a (finite or 
infinite) sequence of pairs of words over £/. As observed in [6], an ordered 
^-generated, /c-relation monoid presentation of sum relation length n is 
uniquely determined by its shape (the sequence of lengths of relation words) 
together with the concatenation in order of the relation words (a word of 
length n). 

Recall (from for example [H Theorem 5.2]) that the number of weak 
compositions of s into r (that is, ordered sequences of r non-negative integers 
summing to s) is given by 

(s + r-1)! 

s\{r-iy. ■ 

Note that if r is fixed and s remains variable then C^(s) is a polynomial of 
degree r — 1 in s. 

Let / : N — > M be a function. The total number of shapes of length n 
is C2^(n). As shown in the proof of [6, Lemma 3.4], the number of shapes 
featuring a block of size f{n) or less is bounded above by 

so the number of such shapes not featuring such a block is bounded below 
by 

Ci,(n) - 2kU(n) + l)Ck-.,(n) = ^-^^^ - 2M/(") + D^^^^^ 



n\{2k-2)l \ 2k -I 
C'2k-i{n) - 2k{f{n) + 1) + l) 

1 2A:(/(n) + 1) + 1 



Now fix an alphabet of size a. By [6, Lemma 3.2] there are at most 
n^a^-f^"^) distinct words of length n which contain a repeated factor of 
length f{n), and so there are at least 

a'' - n^a"-^^") = a" f 1 



a' 

words of length n which do not contain such a repeated factor. Combining 
such a word with a shape with no blocks of length f{n) or less clearly yields 
a presentation in which no relation word appears as a factor of any other 
relation word, that is a strongly C(2) presentation. Dividing by k\ to allow 
for reordering the k relations, we have at least 

(.--I,) «„(^ 

^-generated fc-relation generator-minimal strongly C(2) unordered presen- 
tations of length n. 

Since a presentation presents the same monoid as its equivalence closure 
it follows by Corollary 13.71 that two such presentations present isomorphic 
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monoids if and only their equivalence closures are isomorphic presentations. 
Since a strongly C(2) presentation must be transitive and irreflexive, the 
only difference between two strongly C(2) presentations with the same equiv- 
alence closure is in the order of each relation, so at most 2'^ distinct strongly 
C{2) presentations have the same equivalence closure. And since an iso- 
morphism of presentations over ^ is determined by a bijection on £/, each 
such equivalence closure is isomorphic to at most a! others. Thus, at most 
2^a\ distinct strongly C(2) presentations present isomorphic monoids, so the 
strongly C(2) presentations found above must present at least 

1 „n ^ ^/ . ^ 1 2/c(/(n) + 1) + 1 



n 



distinct isomorphism types of monoids. Setting /(n) = 3 log^ n this becomes 
at least 

1 -» C^,_,(„) „ ^ 1 2K31og„„ + l) + l 



alk\2'' \ nj ' \2k - 1 n 

distinct isomorphism types. Now the factor 

1 



a!A;!2^ 

is a positive constant, while the factors 



l_iUnd^ 1 2t(31og„„ + l) + l 



n J \2k — 1 n 

are eventually bounded below by positive constants and C'2^_]^(n) is a poly- 
nomial of degree 2k — 2 in n. Thus, the number of distinct isomorphism 
types is ^}{a^n'^^~^). 

Finally, the number of isomorphism types of semigroups admitting a- 
generator, /c-relation presentations of length n is clearly bounded above by 
the number of /c-generator presentations over a fixed alphabet of size a. This 
is bounded above by the number of words of length n times the number 
of weak compositions of n into 2k, that is a^O^j^^n). Since C2^{n) is a 
polynomial of degree 2k — 1 in re, it follows that the number of presentations, 
and hence the number of isomorphism types, is 0{a""n?'^~^). □ 



A closer analysis of the combinatorics in the proof of Theorem 13.131 would 
of course yield more precise information about the number of isomorphism 
classes, as well as asymptotics applicable when a and k are permitted to 
vary. 

4. Cancellativity 

In this section we investigate the conditions under which the monoid pre- 
sented by a C(4) presentation is left cancellative, right cancellative or (two- 
sided) cancellative. It transpires that these properties can be characterised 
by very simple and natural conditions on the presentation, and from this it 
follows that one can check in linear time whether a given C(4) monoid has 
any of these properties. Interestingly, it follows also that cancellativity prop- 
erties are asymptotically visible properties of finite monoid presentations, in 
the sense that the probability that an =K/-generated, /c-relation presentation 
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of length selected uniformly at random presents a (left, right or two-sided) 
cancellative monoid converges to a value strictly between and 1, as the 
size of the presentation increases. 

We begin with a technical definition. Let {s^ \ ^) be a C(4) presentation. 
We define a function p : s^* — > Z as follows. For w G s^* define p(w) = — 1 
if w has no clean overlap prefix; otherwise we define p{yS) to be the length 
of the suffix of w following the clean overlap prefix. Notice that p takes 
integer values greater than or equal to —1; we shall use it as an induction 
parameter. 

Lemma 4.1. If w = XYZw' where XY is a clean overlap prefix and p is 
a piece then p{pw') < p{w) 

Proof. By definition we have p{w) = \Zw'\ > 0. If pw' has no clean overlap 
prefix then again by definition we have p{pw') = — 1 and we are done. Oth- 
erwise, suppose pw' has a clean overlap prefix aX'Y' , say pw' = aX'Y'w" . 
Then p{pw') = \w"\. Now X' is the maximum piece prefix of a relation word 
X'Y' Z' , so X'Y' is not a piece. In particular X'Y' cannot be a factor of p, 
so we must have that aX'Y' is strictly longer than p, and so \w"\ < \w'\. 
Hence, 

p{pw') = \w"\ < \w'\ < \Zw'\ = p{w) 
as required. □ 

We are now ready to prove our main result of this section, which charac- 
terises C(4) equivalence presentations of left cancellative monoids. 

Theorem 4.2. Let {s^ \ M) he a C(4) equivalence presentation. Then the 
monoid presented is left cancellative if and only if Si contains no relation of 
the form {ar, as) where a (z and r, s G £/* with r ^ s. 

Proof. Suppose first that S? contains a relation (ar, as) for some letter a (z £/ 
and words r,s (z £/* with r ^ s. Since the presentation satisfies C(2) and 
r is a proper factor of a relation word, it follows by Proposition 13.21 that 
r ^ s. But by definition we have ar = as, so the monoid presented is not 
left cancellative. 

Conversely, suppose {£/ \ contains no relations of the given form, and 
suppose for a contradiction that the monoid presented by | is not 
left cancellative. Then the generators are not left cancellable, so there exists 
a generator a ^ and words u,v (z s^* such that au = av but u ^ v. Let 
a, u and v be chosen such that these conditions hold and p[au) is as small 
as possible. 

First, note that if au has no clean overlap prefix then by Proposition 12.31 
we have au = av as words, so that u = v which gives a contradiction. Next, 
suppose that au has a clean overlap prefix wXY with w non-empty, say 
au = wXYu" . Then by Proposition 12 . II we have av = wv' where v' = XYu". 
But now w = aw' for some word w and we have u = w' XYu" = w'v' = v, 
again giving a contradiction. 

There remains only the case that au has a clean overlap prefix of the 
form XY. Then one of the six mutually exclusive conditions of Lemma 12.21 
hold in respect of au and av. Now conditions (3)-(6) are impossible because 
au and av begin with the same letter, while XY and the complement XY 
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cannot. Indeed, if they did begin with the same letter then so would the 
distinct relation words XYZ and XYZ. But by symmetry and transitivity 
of the presentation we have (XYZ, XYZ) E so this would contradict the 
assumption on the presentation. 

If condition (2) holds we have au = XYu', av = XYv' where u' = v' . But 
Y is non-empty, so we may write XY = ar for some r € ^* , whereupon 
u = ru' = rv' = V again giving a contradiction. 

If condition (1) holds then au = XYZu" and av = XYZv" and Zu" = 
Zv" for some complement Z of Z. We claim that the existence of any piece 
q such that qu" = qv" implies that u" = v"; the proof of this claim is by 
induction on the length of q. For the base case, if q is the empty word then 
certainly we have u" = qu" = qv" = v" . Now suppose for induction that q 
is not the empty word, that qu" = qv" and that the claim holds for shorter 
q. Write q = bq' where b (z ^ and q' £ £/* . Then by Lemma l4.ll we have 

p{bq'u") = p{qu") < p{XYZu") = p{au) 

so by the minimality assumption on a, n and v we have q'u" = q'v" , which 
by the inductive hypothesis implies that u" = v" . This completes the proof 
of the claim. Now returning to the main proof, setting q = Z we deduce 
that u" = v" , and an argument similar to that in case (2) completes the 
proof that u = v, once again establishing the required contradiction. □ 

The combinatorial condition given by Theorem 14 . 2 1 can clearly be checked 
in polynomial time (more precisely, linear time in the RAM model of com- 
putation) so we have: 

Corollary 4.3. There is an algorithm which, given as input a C(4) equiva- 
lence presentation, decides in polynomial time whether the monoid presented 
is left cancellative, right cancellative and/or cancellative. 

Corollary 4.4. Let \ M) he a strongly C(4) presentation. Then the 
monoid presented is left cancellative if and only if M contains no relation of 
the form (ar, as) where a G -s/ and r,s G -s/* . 

Proof. Since a strongly C(4) presentation contains no repeated relation 
words it is already transitive. Hence its equivalence closure {s^ \ 5^) can be 
obtained simply by adding all relations of the form {u,u), {v,v) and {v,u) 
where (n, v) is a relation in Now {£/ \ 5^) presents the same monoid 
as (=2/ I , so by Theorem 14. 2^ ( j?/ | Si.) is left cancellative exactly if 5^ 
contains no relation of the form (ar, as) with a £ and r ^ s. Clearly, this 
is true exactly if M contains no relation of the form (ar, as) with a € ^ and 
r,sG^*. □ 

Our results in this section also have a consequence for the theory of generic 
properties in monoids and semigroups [6j. In the terminology of generic- 
case complexity, the following theorem says that left, right and two-sided 
cancellativity are aymptotically visible properties of j4-generated, ^-relation 
monoids. Such properties are of interest since they seem to be rather rare, 
with most abstract properties of groups and semigroups tending to be ei- 
ther generic or negligible amongst finitely presented examples. To avoid 
defining large amounts of terminology for a single use, we state the result in 
elementary combinatorial terms. 
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Theorem 4.5. Let a/ be an alphabet with > 2. Then the proportion 
-generated, k-relation monoid presentations of sum relation length n (or 
of maximum relation length n ) which present left cancellative [right cancella- 
tive] monoids approaches 

- V 



as n tends in oo. The proportion of £/ -generated, k-relation monoid presen- 
tations of sum relation length n (or of maximum relation length n ) which 
present cancellative monoids approaches 



as n tends in oo. In particular, for \.s^\ > 2, left [right] cancellativity is an 
asymptotically visible property of A-generated k-relation presentations. 

Proof. As with the proof of Theorem 13.13^ we make use of some results and 
arguments from [6j . Rather than repeating these at length, we instead refer 
the interested reader to that paper. 

By Theorem 3.5] (or [6^ Theorem 3.8] when the maximum relation 
length is considered), the proportion of ordered presentations which are 
not strongly C(4) approaches as n increases. Corollary 14.41 tells us that a 
strongly C (4) presentation presents a left [right] cancellative monoid exactly 
if it contains no relation of the form ar = as [ra = sa\ where a G and 
r,s € £/* , so to find the limit of the proportion of left [right] cancellative 
monoid, it suffices to compute the proportion of presentations satisfying this 
condition. 

As described in the proof of Theorem 13.131 an ordered presentation is 
uniquely determined by its shape (a weak composition of n into 2k) and 
the concatenation in order of its relation words (a word over of length 
n). Now for any shape not featuring a block of length 0, the proportion 
of words which do not yield two relation words beginning [ending] with the 
same letter is clearly exactly ((|=k/| — 1)/|=2/|)'^. Similarly, for any shape not 
featuring a block of length or 1 (so that the first and last positions of each 
relation word are distinct), the proportion of words which do not yield two 
relation words beginning with the same letter or ending with the same letter 
is exactly ((K| - 

Now by [HI Lemma 3.4] (or Lemma 3.6] when the maximum relation 
length is considered), the proportion of J2/-generated, /c-relation ordered 
presentations of length n whose shape features a relation word of length 
or 1 tends to zero as n tends to infinity. It follows that we may ignore 
presentations with these shapes, and conclude that the proportion of ^- 
generated, fc-relation ordered presentations of sum or relation length n which 
are left cancellative, right cancellative and cancellative approaches 

respectively as n tends to oo. 

Finally, we turn our attention to unordered presentations. By [6l Theo- 
rem 4.10] the proportion of unordered presentations of length n which fail 
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to be strongly C(4) approaches at n tends to oo, so it suffices to com- 
pute the proportion of strongly C(4) unordered presentations which present 
left cancellative [right cancellative, cancellative] monoids. However, since 
a strongly C(4) presentation contains no repeated relation words, each k- 
relation unordered strongly C(4) presentation corresponds to precisely k\ 
distinct ordered strongly C(4) presentations. Since moreover every ordered 
strongly C(4) presentation arises in this way, it follows that the required 
proportions are the same for unordered presentations as for ordered presen- 
tations. □ 

In fact the argument in the last paragraph of the proof of Theorem 14.51 
establishes the following fact, which generalises [6l Theorem 4.10]. 

Theorem 4.6. Let ^ he any class of abstract monoids, si an alphabet with 
\si\ > 2 and k a positive integer. Then the proportion of s/ -generated k- 
relation ordered monoid presentations of sum [maximum] relation length n 
which present monoids in ^ converges with the proportion of si -generated 
k-relation monoid presentations of sum [maximum] relation length n which 
present monoids in 'if as n tends to oo. 
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